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Abstract. Motivated by the notion of symmetric group spectral analysis developed by Diaconis, we in- 
troduce the notion of spectral analysis on the rook monoid (also called the symmetric inverse semigroup), 
characterize its output in terms of symmetric group spectral analysis, and provide an application to the sta- 
tistical analysis of partially ranked (voting) data. We also discuss generalizations to arbitrary finite inverse 
semigroups. This paper marks the first non-group semigroup development of spectral analysis. 



1. Introduction 



Spectral analysis is a model-free, symmetry-based approach to the statistical exploration and description 
of datasets. In !6 , P. Diaconis gives a method for the spectral analysis of partially ranked voting data using 
symmetric groups. Briefly, suppose an n-candidate election is being held and each voter is asked to rank as 

■ many candidates as he wishes in order of preference, from position 1 (best) to position k (worst among the 
I candidates the voter chooses to rank). His technique begins by viewing this collection of partial rankings as 

■ elements of certain symmetric group modules and taking orthogonal projections of these elements onto the 
\ minimal invariant subspaces of these modules. Such projections give a complete, non-redundant description 

of the dataset. To provide easily understood statistics, inner products of these projections with certain 
"easily interpretable" functions are computed. The resulting statistics constitute the spectral analysis (or 
the symmetric group spectral analysis) of the dataset. His approach was the first non-abelian finite group 
\ generalization of the usual Fourier (or spectral) analytic techniques of time series, based on the abelian group 

0\'. 

, While the symmetric group acts naturally on the set of all partial rankings of n objects, the rook monoid 

\^ ■ Rn is this set of partial rankings. In this paper we explain the algebra of the rook monoid and generalize 

lO I the notion of symmetric group spectral analysis to this new setting. Complications arise because the rook 

monoid is not a group, and while rook monoid modules decompose into minimal invariant subspaces, they 
are not necessarily orthogonal under the natural inner product. We define the notion of rook monoid spectral 
analysis, resolve these complications, and give a complete description of its output in terms of symmetric 
group spectral analysis. 

The main contributions of the present work are the following. First, the algebra Ci?„ provides a natural 
framework for the encoding and analysis of partially ranked data for all partial rankings, not just rankings 
of objects in positions 1 through k. Second, we define and analyze two approaches to rook monoid spectral 
I analysis. Under the groupoid basis association (defined in Section |4.2|) . we show in Theorem 14.41 that rook 

monoid spectral analysis offers a more local, granular approach to the statistical analysis of partially ranked 
(but not fully ranked) data than symmetric group spectral analysis does, in that it amounts to the partitioning 
of a dataset by rank, domain, and range, followed by symmetric group spectral analysis using appropriately- 
sized symmetric groups on each piece of the partition. We reinforce this with an example in Section 14.51 
Under the semigroup basis association (defined in Section [3|), we show in Theorem 14.61 that rook monoid 
spectral analysis offers a hierarchical approach to the statistical analysis of partially ranked data, in that 
it amounts to, for each pair of subsets D and R of {1,2, . . . , n} such that \D\ = \R\, a symmetric group 
spectral analysis, using an appropriately-sized symmetric group, of the partial rankings in the dataset whose 
domains extend D and whose ranges extend R. Finally, we discuss generalizations to other semigroups. 

We proceed as follows. In Section [2] we review basic facts about partial rankings, the rook monoid, and 
inverse semigroups. In Section [3] we use basic ideas from the representation theory of inverse semigroups 
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to define the Fourier transform on a finite inverse semigroup and we explain how this definition gives rise 
to two important examples of group-based spectral analysis — time series and the symmetric group spectral 
analysis of Diaconis. 

In Section 2] we extend group-based spectral analysis to the rook monoid and we discuss extensions to 
inverse semigroups in general. In Section l4.1l we discuss how to perform symmetric group spectral analysis 
using appropriately-sized symmetric groups on each part of the partition (by rank, domain, and range) of a 
dataset consisting of partial rankings and we organize the goals for the rest of the paper. In Section |4?2] we 
review the groupoid basis of Ci?„ and we define the groupoid basis association. In Section l473l we describe 
an inner product under which the isotypic subspaces of CRn are mutually orthogonal, we describe easily 
interpretable functions for partially ranked data, and we describe the isotypic subspaces of CRn in terms of 
the natural statistical information they carry. We develop rook monoid spectral analysis under the groupoid 
basis association and characterize the statistics it generates in Section 14.41 and we look at an example in 
Section 14.51 We then consider rook monoid spectral analysis under the semigroup basis association and 
characterize the statistics it generates in Section 14.61 In Section 14.71 we look at what happens if we try to 
use the natural inner product instead of the inner product introduced in Section 14.31 Section [5] contains 
thoughts on directions for future research. 

Our development relies on several results from the representation theory of inverse semigroups, which 
are included in appendices. In Appendix]^ we review the basic definitions from the representation theory 
of inverse semigroups. In Appendix [B] we review results of B. Steinberg [25] on the groupoid basis of an 
inverse semigroup algebra and the decomposition of an inverse semigroup algebra into a direct sum of matrix 
algebras over group algebras. Our results in Section l473l and our proofs of Theorems 14.41 and 14.61 are based 
on theorems for inverse semigroup algebras in general, which we state and prove in Appendix [Cl 

2. Partial rankings 

Suppose a five-candidate election is being held and you, as a voter, are asked to rank as many of these five 
candidates as you wish in any positions. The candidates are labeled 1 through 5. If you prefer candidate 4 
in first position, candidate 5 in second, candidate 1 in third, candidate 3 in fourth, and candidate 2 in fifth, 
your vote would be 

/ 1 2 3 4 5 \ 
^^^1^3 5 4 1 2 ) ' 

a permutation on {1,2,3,4,5}. We will write our permutations with the domain on the top row and 
corresponding images on the bottom row, so here, for example, 7r(3) = 4. If, on the other hand, you wished 
to express the same preference as above for candidates 2, 4, and 5, without ranking candidates 1 and 3, your 
vote would be 

f 1 2 3 4 5 \ 

^= - 5 - 1 2 j' 

a partial ranking on {1, 2, 3, 4, 5}. The dashes in the second row indicate that 1 and 3 are not in the domain 
of cr. The ranking a indicates a preference for candidates 4 and 5 and a strong distaste for candidate 2, 
without committing to a ranking of the intermediate candidates 1 and 3. 

Definition 2.1. A partial ranking a on {1, 2, . . . , n} is an injective partial function from {1, 2, . . . , n} to 
{1, 2, . . . , ri}. The domain of a, denoted dom(fT), is the set of elements k G {l,2,...,n} for which cr(fc) 
is defined. The range of cr, denoted ran((7), is {cr(fc) : k G dom(f7)}. The rank of cr, denoted rk(CT), is 
dom(cr)| = |ran(cr)|. 

For the choice of a above, we have doni(cr) = {2,4,5}, ran(a) = {1,2,5}, and rk(cr) = 3. We use the 
usual operation of partial function composition and we adopt the convention that maps act on the left of 
sets and are composed right-to- left: If cr, 7 are partial rankings on {l,2,...,n}, then cr o 7 is the partial 
ranking on {1, 2, . . . , n} whose domain is the set of elements k for which k G dom(7) and 7(fc) G dom(CT), 
and if fc G dom(CT o 7), then {a o 7)(fc) — cr(7(fc)). 

A semigroup is a nonempty set with an associative binary operation. A monoid is a semigroup with an 
identity element. Unless otherwise specified, we will write our semigroup operations multiplicatively. 

Definition 2.2. For an integer n > 0, the rook monoid Rn is the set of all partial rankings on {1, 2, . . . , n} 
under the operation of partial function composition. 
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It is easy to check that i?„ is a monoid. We denote the symmetric group on {1,2, ... ,n} by S'„. Rn 
contains Sn as the set of elements of rank n, and the identity element for the operation on i?„ is the identity 
of Sn. In fact, Rn contains isomorphic copies of all Sk, for < A: < n, which are obtained by identifying Sk 
with the set of elements of i?„ whose domain and range are both {1, 2, . . . , k}. 

Definition 2.3. If S and T are semigroups, then a semigroup homomorphism (or just homomorphism) (j) 
from 5 to T is a map (j) : S ^ T such that (f>(ab) = (f>{a)(j){b) for all a, 6 G 5. A semigroup isomorphism (or 
just isomorphism) is a semigroup homomorphism that is one-to-one and onto. 

Another way to view i?„ is as the set of all n x n matrices that have at most one 1 in each row and 
column (their other entries being 0), under the usual operation of matrix multiplication. Such matrices are 
called rook matrices. Given a partial ranking a G Rn, we can create an n x n rook matrix by placing a 1 in 
the i,j position whenever (t(j) — i and having all other entries be 0. It is clear that this association is an 
isomorphism, and furthermore that the rank of a partial ranking a is the same as the rank of its associated 
rook matrix. i?„ is called the rook monoid because the collection oi n x n rook matrices corresponds to the 
set of possible placements of non-attacking rooks on an n x n chessboard. 

Although Rn is not a group (unless n = 0, in which case _Ro — ^i), Rn does have a nice algebraic 
structure — that of an inverse semigroup [4] . 

Definition 2.4. An inverse semigroup is a semigroup S with the property that, for each x G S, there exists 
a unique y G 5 such that xyx — x and yxy — y. In this case, y is said to be the inverse of x, and we write 
x"^ = y- 

It follows that, in an inverse semigroup, if a;^^ = y then y^^ — x, xx~^ is idempotent, and if e is idempotent 
then = e. Every group is an inverse semigroup, but not conversely. Also, we have emphasized the word 
unique in this definition, as uniqueness of an element's inverse does not follow from the rest of the hypotheses 
as it does for groups. For example, for n > 2, in T„, the full transformation semigroup on n elements (the 
set of all functions from {1, 2, . . . ,n} to {1, 2, . . . ,n} under function composition), for each element x there 
is at least one y such that xyx = x and yxy = y, and there exist elements x for which there are multiple 
elements y satisfying both equations. 

It is easy to see that the inverse of an element a G Rn is the partial ranking 7 whose domain is ran(CT), 
and whose definition (informally) is given by sending everything in ran((T) back where it came from. Viewing 
the elements of Rn as rook matrices, the inverse of a rook matrix is its transpose. 

3. Representations and spectral analysis 

Our development of spectral analysis depends on the representation theory of inverse semigroups. The 
basic definitions are similar to those for groups, and are included in Appendix |X] for the convenience of the 
reader. Let S* be a finite inverse semigroup and let CS denote the complex algebra of S. 

Definition 3.1. The natural basis of CS, i.e., the basis {sj^gs, is called the semigroup basis of CS. 

Elements of CS can be identified with complex-valued functions on S* in a natural way. Specifically, if 
f : S C, then / corresponds to the element X^sss fi^)^ ^ '^^ '^^^ therefore be seen as the algebra 
of complex-valued functions on S. This association between functions on S and elements of CS is called the 
semigroup basis association. There is, for non-group inverse semigroups in general, a different natural basis 
of CS and therefore another natural way to associate functions on S and elements of CS*, called the groupoid 
basis association, which we define in Section 14.21 

CS is semisimple. When S* is a group, this is Maschke's theorem For general S, this is a result of 
Munn 1181 Theorem 4.4]. Since CS is semisimple, Wedderburn's theorem applies to CS*. Semisimplicity and 
the Wedderburn isomorphism are the key ingredients for the spectral analysis we develop. 

If / G CS and p is a matrix representation of CS*, denote p{f) by f{p). 

Theorem 3.2 (Wedderburn's theorem). Let y be a complete set of inequivalent, irreducible matrix repre- 
sentations of CS . Then y is finite, and the map 



(1) 
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is an isomorphism of algebras. Explicitly, if f E CS , with f = X^seS then 

pey peyses 

in this isomorphism. 

Definition 3.3. Given / G CS and a complete set of inequivalent, irreducible matrix representations y of 
CS, the Fourier transform of f according to y (or just the Fourier transform of /) is the image of / in the 
Wedderburn isomorphism p]). 

Definition 3.4. The inverse image of the natural basis of the algebra on the right in the Wedderburn 
isomorphism ([ij (that is, the set of matrices in this algebra which have a 1 in one position and in all other 
positions) is called the Fourier basis of CS according to y. 

Thus the Fourier transform of / is, in general, a block diagonal matrix with complex entries, and we can 
view the Fourier transform of / according to 3^ as a change of basis within CS*, from the natural basis {s}sg5 
of CS, to the Fourier basis of CS according to 3^. 

Fourier transforms are closely related to the notions of spectral analysis. We begin by seeing how these 
notions apply to time series. 

Example 3.5 (Time series). Let S — (Z„, +) = {0, 1, . . . , n — 1}, the cyclic group of order n. The irreducible 
representations of CZ„ are all one-dimensional — they are the characters Xk for k G {0, . . . , n — 1}, defined 
on the natural basis of CZ„ by Xk{t) = e^''*'^*/". If / : Z„ C and we view / e CZ„ as / = Y.'t=o /(O^: 
then 

ri-l 
t=0 

the familiar discrete Fourier transform of /. The Fourier basis of CZ„ is the usual basis of sampled expo- 
nentials {6fc})!ZQ C CZ„: 

-. n— 1 

n ^-^ 

t=o 

We now explain how things generalize beyond = Z„ . We can often view a dataset as an element of some 
CS'-module for some finite inverse semigroup S. Let 5 be a finite inverse semigroup and let / be a dataset, 
viewed in some way as an element of some left CS'-module M . Since CS is semisimple, M decomposes into 
a direct sum of irreducible CS-submodules Mi : 

M = Mi®M2©---©Mfe. 

Unfortunately, the Mi are not uniquely determined in general. For any irreducible submodule N of M, 
whether or not N appears as a direct summand in this particular decomposition, let Vn denote the sum 
of all irreducible submodules of M isomorphic to N. Vn is called the isotypic component of M of type 
(or isomorphism class) N. As N ranges across the irreducible submodules of M, we obtain the isotypic 
components Vn of M. They are uniquely determined, and M decomposes as the direct sum of them. 
Furthermore, given any decomposition of AI into irreducibles M — Mi ® M2 © • • • ® M^,, if we group the 
Mi according to their isomorphism classes and sum together the Mi from each isomorphism class, then we 
obtain the isotypic components of M [2]. Let 

M = ® F2 © • • • © 

be the decomposition of M into its isotypic components Vi, V2, . . . , Vm- This decomposition is more crude, 
in general, than a decomposition of M into irreducibles, but it has the advantage of being a unique decom- 
position of M into invariant subspaces under the action of CS. In fact, it is the finest unique decomposition 
of M into invariant subspaces under the action of CS", in the sense that attempting to decompose any Vi 
further into invariant subspaces requires a choice of basis. We do not want our definition of spectral analysis 
to depend on an arbitrary choice such as this, so it is the isotypic decomposition that we will work with. 
Loosely speaking, the spectral analysis of f £ M is the examination of the projections of / onto the isotypic 
components Vi C M. We call these projections the isotypic projections of f. 
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In the case of time series, CZ„ decomposes into a sum of n one-dimensional isotypic components — with 
notation as in Example I3.5[ let Vk — C-span(6fc) — C-span(^"ZQ'^ e~^'^*'^*/"<) C CZ„. Then we have the 
isotypic decomposition CZ„ = ®fc=o ^fc' ^^"^ spectral analysis of / G CZ„ amounts to an examination of the 
projections of / onto the Vk- In contrast to time series, however, many of the Vi may be multidimensional in 
general, and to make the notion of spectral analysis precise for a given semigroup S we will need a method 
to extract information from these projections. How exactly we should do this depends on the particular 
semigroup under consideration. We will explain this for the symmetric group in Example 13.61 for the 
rook monoid in Sections 14.31 and 14.41 

The most important CS'-module is M — CS itself (where the action of CS on M — CS is given by 
the multiplication of CS"), where the isotypic decomposition of M can be obtained from the Wedderburn 
isomorphism ([T]). CS" is both a left and right CS'-module. Notice that the inverse image of a column 
(respectively, row) of the p block of the algebra on the right in ([T|) is an irreducible left (respectively, right) 
submodule of CS* of isomorphism class p. The inverse image of the p block in ([T|) is thus the isotypic 
component of M of isomorphism class p, and is also a minimal two-sided ideal of CS. Hence the isotypic 
decomposition of CS is the same as the (unique) decomposition of CS" into the direct sum of its minimal 
two-sided ideals. 

Isotypic projections in CS are easy to compute from Fourier transforms. Let p be an irreducible matrix 
representation of CS and let y be any set of inequivalent, irreducible matrix representations of CS". Let 
7 G 3^ denote the representation in y equivalent (if not equal) to p, and denote the isotypic component of 
CS" of type p hy Vp. To compute the isotypic projection of / S CS onto Vp, take the Fourier transform of 
/ according to y, set all coefficients of the result equal to except for the ones in the 7 block, and take 
the inverse image of that. The result is the isotypic projection of / onto Vp. It is easy to see that this 
works regardless of the particular matrix representations chosen for y. Computationally efficient methods 
for computing Fourier transforms and their inverses on a wide variety of groups and semigroups have been 
developed. See, for example, [ll[3l[5l[lTl[l3[T3[Tl|T5l[21]. 

Example 3.6 (Symmetric group spectral analysis). This example is an exposition of the ideas of Diaconis 
[6]. We explain his ideas from an algebraic standpoint that will be useful for us when we generalize to the 
rook monoid in Section |4l It can be shown that our development here is equivalent (in the sense that it 
generates the same statistics for any partially ranked voting dataset on any number of candidates) to his. We 
review only the algebraic aspects that generate the statistics. For full discussion, including a large example 
and inferential issues, see [Hj. 

First we explain his technique as applied to fully ranked votes. A collection of votes in which every voter 
ranks each of n candidates in order of preference defines a C- valued (actually, a Z- valued) function on Sn, 
where /(fi) is the number of voters casting a ballot of type a. Let / : Sn — >■ C and view / as an element of 
CSn as / = X^ctgs fi'^)'^- There is a well-known bijection between the irreducible representations of CSn 
and the partitions of n [10 , so we write 

Ahn 

where V'^ is the isotypic subspace corresponding to the irreducible representation for the partition A. The 
irreducible representation for A is commonly described in terms of the action of Sn on tableaux of shape A — 
see, e.g., [10]. What we really need for spectral analysis are combinatorial descriptions of the V'^ themselves. 
The descriptions we give below are due to Diaconis [5], and will allow us to describe the natural statistical 
information each isotypic subspace carries. 

There is a natural inner product on CSn given by 

if, 9) = ( E /(^)^' E ai'^y) = E 

Under this inner product, the isotypic subspaces of CSn are mutually orthogonal [HI Chapter 2]. We now 
project / e CSn onto each subspace. That is, we write / = X^Ahn f^^ ^'^^ unique elements G V^. These 
projections may be computed by running a (fast) Fourier transform on Sn, provided n is not too large 
[3j[T3]. Other projection formulas are also available. See, for example, [HJ Theorem 1], |Z|, and [221 Theorem 
8]. 
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Next, we examine the projections /'*'. This is analogous to examining the component frequencies of a 
function in the S* = Z„ case. However, in our case, many of the are multidimensional, and in addition 
to concrete descriptions of these spaces we will use an additional device (which Diaconis attributes to C. 
Mallows) to extract information from the projections onto these spaces. 

We also note that, in a similar fashion to how the frequencies of highest amplitude carry the most 
information about the structure of a continuous waveform, here the lengths of the projections are important 
in determining which projections carry the most information about the structure of a dataset. However, due 
to the differences in dimensionality between the isotypic subspaces involved here, it is sometimes appropriate 
in making this determination to weight the lengths of the projections based on the dimensions of the subspaces 
in which they reside — see [6] for more details. By orthogonality of isotypic subspaces, we have 

(2) ii/iP = (/,/) = E(/''/')-Eii/'ii'' 

Ahn Ahn 

which allows one to compute and compare easily the lengths of the projections. 

First, V^(") is the space of constant functions on the fully ranked votes. It is one-dimensional, and 



/ 



(«) 



Vo-6S„ / \' (res,, y 



where the quantity on the right is the Fourier basis element (for any complete set of inequivalent, irreducible 
representations of CSn) lying in V^'^\ The projection Z*^"^ therefore records the number of votes cast. 
Next, there are easily interpretable (or just interpretable) first-order functions. They are of the form 



where 



Jl ifa(z)=j, 
1 otherwise, 

as i and j range over {1, 2, . . . , n}. is an (ri — 1) ■^-dimensional space. A general element of 

has the form 

i-.j 

where, since 

ig orthogonal to = 0. carries the "pure" first-order statistics 

for fully ranked votes (i.e., the first-order information about the data once the average — the zeroth-order 
information — has been removed by the projection onto y^"^). The device of Mallows used by Diaconis for 
extracting information from /("^^'i) is this [6l Section 2C]: for each i,j G {1, 2, . . . , n}, examine the inner 
product of with Si^j. It turns out that 

where 

w{a) 



n-1 



if a{i) = j, 
— otherwise. 

V n 

Next, just as there are easily interpretable first-order functions, there are also easily interpretable second- 
order (ordered and unordered) functions. The easily interpretable second-order unordered functions are 
the 

hH,l2}^{jlj2} ^ E '^{H,»2}^{J1J2}(^)^' 

where 

1 if {CT(ii),cr(i2)} = {jl, j2}, 
otherwise. 



The representation theory of CSn implies that every element of F*^" is a linear combination of the 
i5{ij^_i2}^{jj which is orthogonal to the other isotypic subspaces. We denote the easily interpretable 
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second-order ordered functions (defined analogously) by di^^^j^^^i^^j^, and elements of F^""^'^'^) are linear 
combinations of such which are orthogonal to the other isotypic subspaces. There are also easily interpretable 
third-order functions and so on. As before, we compute the inner products of with the S{ii,i2}^{ji,j2} 

and of /("-i^i'i) with the Si^^^j^^i^t^j^ to obtain second-order statistics of /. 

In a similar fashion, we can continue as far as we'd like with the remaining isotypic subspaces to extract 
third-order and higher-order statistics about /. 

Definition 3.7. The statistics created by projecting a data vector / G CS'„ onto the isotypic subspaces 
of CS'n and computing the inner products of these projections with the easily interpretable functions as 
described above constitute the symmetric group spectral analysis of /. 

Next we define Diaconis's notion of symmetric group spectral analysis for partially ranked votes. Let 
k < n and suppose we are interested in analyzing the set of votes in an election with n candidates in which 
every voter ranks their top k candidates in order of preference. The collection of such votes defines a function 
/ on the rank-fc elements of Rn of range {1,2,..., k}, where /(u) is the number of voters who prefer the 
partial ranking a. For each clement a of i?„ of range {1, 2, . . . , fc}, form the following element of CS'„: 

^ ' teSn-t>cr 

where t > a simply means that t extends cr as a partial function, and E{a) is the number of elements t G S'„ 
that extend a. Next, form the following element of CS'„: 

(TGfl„:ran(cr) = {l,2,...,fc} 

Finally, compute the symmetric group spectral analysis of F. If a dataset of partial rankings contains data 
consisting of multiple ranks, then the analysis begins by separating the data according to rank and then 
proceeds separately, rank-by-rank, generating a different set of statistics for the data of each rank. The main 
example in [B] consists of data of ranks one through five. 



4. Rook monoid spectral analysis 

4.1. Preliminaries. As explained in Example 13.61 symmetric group spectral analysis begins by partitioning 
a dataset of partial rankings by rank before analyzing it — the output of the rank-Zc spectral analysis for voting 
data depends only on the rank-fc votes. For voting data, it might make sense to partition the dataset by 
rank before performing spectral analysis if one thinks that voters who vote with different ranks might vote 
differently. Indeed, this was the case in the main example in [^. 

For certain kinds of voting (or other partially ranked) data it might not make sense to partition the data 
by rank before analyzing it (in which case the full dataset can be averaged to create an element of CSn 
for analysis), or it might make sense to partition the data to an even finer degree before analyzing it — for 
instance, one might partition the data by rank, domain, and range before performing symmetric group 
spectral analysis on each part of the partition. 

To explain what we mean by symmetric group spectral analysis on such a set of partially ranked data, fix 
k < n and two subsets D and i? of {1, 2, . . . , ri} of size fc, and let R^'^ = {cr G _R„ : dom{a) ~ D, ran(CT) — R}. 
Suppose we wish to perform symmetric group spectral analysis on a function / : R^'^ C. Let PdiPr S i?„ 
be the unique order preserving bijections from {1, 2, . . . , fc} to and R, respectively. Identify Sk with the 
elements of _R„ whose domain and range are both {1, 2, . . . , fc}, view / as an element of CSk by 

and apply symmetric group spectral analysis (in CSk) to /. As an example, for n = 5, k = 3, D = {2, 4, 5}, 
and R — {1, 2, 5}, for the choice of a at the beginning of Section [2] we have 

_i / 1 2 3 4 5 \ 

PR ^PD=^^ ^ 2 _ _ j. 
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Effectively, tliis is because a sends tlie first element of its domain to the third element of its range, the second 
element of its domain to the first element of its range, and the third element of its domain to the second 
element of its range. 

We now proceed by extending the ideas of symmetric group spectral analysis to the rook monoid. In 
fact, for a function / : i?„ C, we define two different approaches to the rook monoid spectral analysis 
of /. (There are two different natural bases of Ci?„, and the differences in our approaches arise from the 
choice of which basis to associate with the delta functions of the elements of Under the groupoid basis 
association, defined in Section 14.21 we show in Theorem 14.41 that the rook monoid spectral analysis of / 
amounts to the idea for analysis above — that is, it amounts to a partitioning of / by rank, domain, and 
range, before performing symmetric group spectral analysis (using appropriately-sized symmetric groups) 
on each part of the partition. Under the semigroup basis association, we show in Theorem 14.61 that rook 
monoid spectral analysis offers a hierarchical approach to the analysis of / — in particular, it is the same as 
the rook monoid spectral analysis, under the groupoid basis association, of the function g : i?„ — > C given 

by 

tefl„:t>(T 

where t > a means that t extends ct as a partial function. 

To explain precisely how the algebra of Ci?„ leads to these methods of analysis we will need the following 
three things. First, we need an inner product under which the isotypic subspaces of Ci?„ are mutually 
orthogonal. Second, we need easily interpretable functions for partially ranked data. Finally, we need 
concrete descriptions of the isotypic subspaces of Ci?„ in terms of the natural statistical information each 
carries. We handle these three considerations in Section 14.31 with the help of the other natural basis of 
CRn — the groupoid basis — which we now review. 



4.2. The groupoid basis of the rook monoid algebra. The natural partial order on i?„ is defined in 
the following way: for s, t G say t < s if and only if s extends t as a partial function. The groupoid basis 
of CRn is the collection {[sJl^gij^, where 

[s\ = J2 (-i)'-'^(^)-^''(*)t. 

teB.„:t<s 

It is wen known [HISS] that (_i)ik(s)-rk(t) ^ ^(^^ ^j^gj.g 

fi is the Mobius function of <, so we can recover 
the semigroup basis of Ci?„ by inverting the Mobius function: 

E 

tG-R„:i<s 

The groupoid basis is a basis for Ci?„ , with multiplication given by the following formula ^2F : 

^ 1 lst\ if dom(s) = ran(i), 
1 otherwise. 

That is, the product [sj [ij is nonzero in <CR„ precisely when the domain of s lines up exactly with the range 
oft. 

There is a corresponding notion of a groupoid basis {[sjjsgg of <CS for any finite inverse semigroup S 
[25j , which we review in Appendix |B] 

Definition 4.1. Let 5 be a finite inverse semigroup and let / : — > C. Under the groupoid basis association, 
f corresponds to the element J2ses /(*)L*J £ 

It turns out that the groupoid basis of CS has a number of important implications for the representation 
theory of CS — for us, it will be instrumental in describing the isotypic subspaces of CS' and it will also yield 
an inner product under which the isotypic subspaces of CS are mutually orthogonal. We describe these 
implications for CRn in Section 14.31 and for CS in general in Appendix [C] 
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4.3. Isotypic subspaces, interpretable functions, and an orthogonal inner product. In this section 
we give an inner product under wlriclr tire isotypic subspaces of Ci?„ are mutually orthogonal, we describe 
easily interpretable functions for partially ranked data, and we give concrete descriptions of the isotypic 
subspaces of Ci?„ in terms of the natural statistical information they carry. 

We begin by noting that under the natural inner product on Ci?„ (obtained by declaring the semigroup 
basis of CRn orthonormal), the isotypic subspaces of Ci?„ are not mutually orthogonal in general. For a 
simple example, consider Ci?i — C-span(Id, A^) (where N denotes the null map). The irreducible represen- 
tations of Ci?i are both 1-dimensional. They are given by the linear extension of (x) = 1 for all x G Ri, 
and by the linear extension of 

pi(Id) = l, p\N)=0. 

CRi therefore splits into isotypics as CRi = ®V'^, where = C-span(A'^) and — C-span(Id — N). 
Under the natural inner product on Ci?„, we see that {N, Id — N) = — 1, so and V-^ are not mutually or- 
thogonal. This failure is caused by an "entanglement" between ranks that increases as n increases. Although 
an inner product under which the isotypic subspaces are mutually orthogonal is not strictly necessary for 
spectral analysis, it would give us nice mathematical properties (for instance, if /, g G CS and /, g denote 
the projections of / and g onto some isotypic subspace of CS*, then under such an inner product we would 
have {f,g) — {f,g) — (fig)), and it would aid in a sum-of-squares analysis as in ([2]). The groupoid basis 
effectively undoes the entanglement between ranks that causes this failure — declaring the groupoid basis 
orthonormal yields an inner product under which the isotypic subspaces of Ci?„ are mutually orthogonal. 



Theorem 4.2. Let (•, •} be the sesquilinear form on CRn induced by, for G Rn, 



1 if s = i, 
otherwise. 



Then, with respect to this inner product, the isotypic subspaces of Ci?„ are mutually orthogonal. 



Theorem 14.21 was proved in ^iTj . We extend it to finite inverse semigroups in general in Theorem IC. 31 
Next we describe rank-k easily interpretable functions (or just interpretable functions) for partially ranked 
data. Let k < n. The zeroth-order interpretable functions are the functions S^'^ : i?„ C, defined by 

!:D Rf \ J 1 dom((T) = D and ran(CT) = R, 
' (a) = < 

I otherwise, 

as D and R range across the size-fc subsets of {1,2,..., n}. If fc > 1 we also have the first-order interpretable 
functions S^J^j, defined by 

gD,R^^-^ ^ |l if dom(CT) = D, ran(CT) = R, and a{i) = j, 
1 otherwise, 

as D and R range over the size-fc subsets of {1,2,..., n}, i ranges over D, and j ranges over R. If fc > 2 
we also have second-order unordered and second-order ordered interpretable functions. The second-order 
unordered interpretable functions are the j^}' defined by 

D,R (a) - < dom((7) = D, mn{a) = R, and {a{ii),a{i2)} = {ji, ^2}, 

{'i^'2}^{nM^'^> - jo otherwise, 

as D and R range over the size-fc subsets of {1,2,..., n}, {ii, 12} ranges over the size-2 subsets of D, and 
{JI7J2} ranges over the size-2 subsets of R. The second-order ordered interpretable functions are defined 
similarly. If fc > 3 we also have third-order interpretable functions which are defined in an analogous fashion, 
and so on. 

Next we describe the isotypic subspaces of Ci?„. The isotypic subspaces of CRn are in bijection with the 
partitions of the integers {0, !,...,«} (which can be seen from Theorem IB. 5|) . so write 



A:=0 Xhk 
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where is the isotypic subspace of Ci?„ corresponding to the irreducible representation for the partition A. 
The irreducible representation corresponding to the partition A can be described by combining descriptions 
of the irreducible representations of the symmetric group with Theorem IB. 5i and the following descriptions 
of the arise by combining Diaconis's descriptions of the isotypic subspaces of CS'„ in Example 13.61 and 
[B] with a technical result in Appendix [Cl fTheorem lC.l|) . Let k < n. 
V^''^ is spanned by the elements 

(3) E '^'''''MW' 

as D and R range over the size-fc subsets of {1,2,..., n}. F^'^-' therefore carries zeroth-order information for 
rank-fc data. Notice that, for any fixed choice of D, R, the element in ([3]) is the function S^-^ viewed as an 
element of Ci?„ under the groupoid basis association. 

For purposes of the following descriptions, let us continue to view the interpretable functions as elements 
of Ci?„ under the groupoid basis association. For example, we have 

Every element of V^C'^i'i) is of the form 

D.RxD,R 

D,flC{l,2,. ..,«}: i,j 

\D\ = \R\ = k 

where for every choice of D and R we have V]- af = 0. yC^^i^i) therefore carries pure first-order 
information for rank-/c data. 

Similarly, every element of yC^^^^z) jg hnear combination of the j^} '^hich is orthogonal to 

the other isotypic subspaces, and so on. y('=^2,2) therefore carries pure second-order unordered information 
for rank-A: data, ^(^^"2,1,1) carries pure second-order ordered information for rank-fc data, and so on. 

4.4. Rook monoid spectral analysis under the groupoid basis association. Let / : Rn C. We 

now define rook monoid spectral analysis of / under the groupoid basis association, i.e., where we view 
/ G £Rn by 

We use the inner product on Ci?„ induced by declaring the groupoid basis mutually orthogonal. As in Section 
14.31 let us view easily interpretable functions as elements of Ci?„ under the groupoid basis association. 

Definition 4.3. Let / : i?„ -4 C. The statistics created by projecting X^o-s-R onto the isotypic 

subspaces of Ci?„ and computing the inner products of these projections with the appropriately-paired easily 
interpretable functions constitute the rook monoid spectral analysis of f under the groupoid basis association. 

By appropriately-paired, we mean that the easily interpretable functions 6^'^ with \D\ = \R\ = k are 
paired with the projection f^'^\ the interpretable functions S^^j with \D\ — = fc are paired with the 
projection f^''^^'^\ the interpretable functions ^{ifl^j^^iji 32} ^^^^ l-^l = \R\ = k are paired with the 
projection j('=-2,2)^ 

We now describe the statistics that result from this approach in terms of symmetric group spectral 
analysis. For every pair of size-fc subsets D and i? of {1, 2, ... , n}, denote by /^'■^ the restriction of / to 
{cr e i?„ : dom((T) = D, ran(cr) = R}. We may apply symmetric group spectral analysis (using CSk) to /^'^ 
in the manner described in Section [4.1l 

Theorem 4.4. The statistics generated by the rook monoid spectral analysis of f , under the groupoid 
association, are the same as the statistics generated by applying symmetric group spectral analysis using 
appropriately-sized symmetric groups, in the manner described in Section \4.1\ separately to each function in 
the collection 

n 

y {/^'« : Z?, i? C {1, 2, . . . , n], \D\ = \R\ = k}. 

k=0 
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By appropriately-sized symmetric groups, we simply mean that CSk is used for /^'^ when k = \D\ = \R 



Proof of Theorem\l4\ Let k < n and let D, R C {1,2, . . . ,n} with \D\ = \R\ = k. Let / = Eaefl„ fi^)W\- 
Let : i?„ C by 



For clarity, the only difference between F and / ' are their domains. View F as an element of Ci?„ 
under the groupoid basis association. Theorem IC.2I says that every non-zero isotypic projection of F^'^ 
in CRn can be written in terms of groupoid basis elements [crj for which dom(CT) = D and ran(CT) = R, 
and that the non-zero isotypic projections of F^'^ in Ci?„ are (after perhaps a relabeling of the domain 
and range) the same as the non-zero isotypic projections of /^'^ in CSk- The easily interpretable functions 
of domain D and range R in CRn were defined in such a way that their inner products with the isotypic 
projections of F^'^ in Ci?„ are the same as the inner products of the isotypic projections of f^'^ in CSk 
with the easily interpretable functions in CSk- Furthermore, it is immediate that the inner products of the 
isotypic projections of F^'^ in CRn with the other interpretable functions in CRn are zero. 

Next, if (? = J2aGR 9i'^)l'^\ ^ and g{a-) = whenever dom(cr) = D and ran((T) = R, Theorem I C. 2 1 
says that the isotypic projections of g in CRn, when written in terms of the groupoid basis, have nonzero 
coefficients only for groupoid basis elements [crJ for which dom(CT) ^ D or dom{a) ^ R. Therefore, for any 
such element g g Ci?„, the statistics generated by the inner products of the isotypic projections of F^'^ with 
the easily interpretable functions of domain D and range R in Ci?„ are the same as the statistics generated 
by the inner products of the isotypic projections of F^'^ -\-g with the easily interpretable functions of domain 
D and range R in CRn- In particular, for some such element g £ Ci?„ we have / = F^'^ + g, so the statistics 
that arise from the inner products of the isotypic projections of / S Ci?„ with the interpretable functions of 
domain D and range R in CRn are the same as the statistics that arise from the symmetric group spectral 



In an analogous fashion, Theorems IC. II and IC . 21 show that if S is any finite inverse semigroup, f : S ^ C, 
and we view / e CS using the groupoid basis association, then spectral analysis of / boils down to the 
spectral analysis of the components of / using the group algebras of the maximal subgroups of S- 

Theorem 14.41 shows that rook monoid spectral analysis under the groupoid basis association is different 
from symmetric group spectral analysis, in that it gives a more granular picture of the partially ranked data 
in a dataset. Next we give a direct comparison between the two approaches in the context of an example. 

4.5. An example. We now apply rook monoid spectral analysis under the groupoid basis association to 
a particular collection of partially ranked voting data. Our dataset for this example is the well-studied 
collection of votes from the 1980 American Psychological Association (APA) election, in which voters were 
asked to rank five candidates in order of preference. 15449 people voted, of which 5738 fully ranked all five 
candidates. The rank-3 votes are tallied in Tabled] [6, Table 6] and the votes of other ranks can be found in 
[5J Tables 1 and 6]. Each vote is a partial ranking. 

This dataset defines a C- valued (actually, a Z- valued) function / on i?5, where f{a) is the number of 
voters casting a ballot of type cr. The a here are written in standard list-form, with the top row removed. 
For example, looking at [6, Table 1] we have 




/((t) if dom(cr) = D and ran((T) = R, 
otherwise. 



analysis (using CSk) of / 



□ 




2 
3 



3 
1 



4 
4 




and from Table [T] [6l Table 6] we see that 
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Table 1. Rank- 3 ballots 





Vote 




Tally 


Vote 


Tally 


Vote 


Tally 


Vote 


Tally 




2 3 — 


— r- 


27 


1*^5 -'-5 


21 


38 


fl — — 2 31 


44 


f— 3 1 — 21 


16 




3 2 - 


_ 


79 


l''; 'J, 


- - 11 


35 


1 3 2 


35 


- 2 3 - 1 


14 


\2 


13- 




31 




- - 11 


41 


2 - - 1 3 


46 


- 3 2 — 1 


15 




1 2 - 


_ 


32 


fl - 




30 


2 3 1 


62 


- 1 - 2 3 


55 




3 1 - 




83 


fl - 


S 9 — 1 


21 


3 12 


90 


- 1 - 3 2 


45 


[3 


2 1- 

^, _L, 


_1 


57 


f2 - 


13-1 


39 


3 - - 2 1 


75 


_ 2 — 1 3 


54 


[1 


2, -,3 


„j 


19 


[3,- 


1,2,-] 


15 


[-,1,2,3, -] 


9 


[-,3, -, 1,2] 


62 


[1 


3,-, 2 




22 


[2,- 


3,1,-] 


15 


[-,1,3,2,-] 


17 


[-,2,-, 3, 1] 


50 


[2 


l,-,3 




31 


[3,- 


2,1,-] 


13 


[-,3,1,2,-] 


26 


[-,3, -,2,1] 


59 


[3 


l,-,2 




45 


[1,- 


3,-, 2] 


41 


[-,2,1,3, -] 


17 


[-,-,1,2,3] 


15 


[2 


3,-,l 




28 


[1,- 


2,-, 3] 


49 


[-,2,3,1,-] 


21 


[-,-,1,3,2] 


19 


[3 


2,-,l 




51 


[2,- 


l,-,3] 


74 


[-,3,2,1,-] 


18 


[-,-,2,1,3] 


16 


[1 


2,-,- 


,3] 


26 


[3,- 


l,-,2] 


47 


[-,1,2,-, 3] 


8 


[-,-,3,1,2] 


46 


[1 


3,-,- 


,2] 


31 


[2,- 


3,-,l] 


37 


[-,1,3,-, 2] 


15 


[-,-,2,3, 1] 


17 


[2 


1,-,- 


>3] 


17 


[3,- 


2,-,l] 


32 


[-,2,1,-, 3] 


16 


[-,-,3,2,1] 


20 



We have the isotypic decomposition 

Ci?5 = (y^''^ © V^'^'^ ® V^^'^'^ © ® y(2,2,l) ^ y(2,l,l,l) ^ ^(1,1,1,1,1) j 

© © y(3,l) ^ ^(2,2) ^ ^(2,1,1) y(l,l,l,l)) 

© e y(2,l) ^ ^(1,1,1)) ^ ^yi2) ^ ^(14)^ 05 (^^(1)^ ^ (^^(0)^ ^ 

where T^''^ is the isotypic subspace of Ci?5 corresponding to the irreducible representation for A. View / as 
an element of Ci?5 under the groupoid basis association. 

We begin our analysis by projecting / onto the isotypic subspaces, that is, by writing 

fc=0 Ahfe 

for unique elements G V^. We use the inner product induced by declaring the groupoid basis elements 
of Ci?„ mutually orthonormal, so that the are mutually orthogonal. 

Under the groupoid basis association, the rank-fc data projects onto the where A 1- fc, and we may 
therefore carry out our analysis rank by rank. According to Theorem 14.41 our rank-5 analysis is exactly 
the same as that provided by symmetric group spectral analysis applied to the rank-5 votes. The results 
from that analysis may be found in 'F| . It is in the partially ranked data that rook monoid spectral analysis 
differs. 

The projections for A h 4 are all zero, as f{a) = for all a ^ such that rk((T) = 4. After all, 
ranking n — 1 out of n candidates naturally ranks the nth as well. 

Next we consider the projections f'^ for A 1- 3. Recall from Section l473l that 1/^^^ is the sum of the spaces 
of constant functions for each of the rank-3 choices of domain and range. That is, V^^'^ is spanned by the 
elements 

as D and R range across all size-3 subsets of {1,2,3,4,5}. We take the projection Z*^^^ and compute the 
inner products of it with these 6^'^ to obtain Tabled] Notice that the D,R entry is simply the number of 
rank-3 voters ranking the candidates in D in the positions in R. 

More interesting is f^'^'^\ which in this case contains both the pure first-order and second-order unordered 
information. To explain, we have the easily interpretable first-order rank-3 functions 

S^h^ = E 
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Table 2. Zeroth-order groupoid analysis, rank-3 data 





Range 


Domain 


1,2,3 


1,2,4 


1,2,5 


3,4,5 


L,Z,6 


ouy 


u 


u 


u 


1 9 4- 


196 











1,2,5 


188 











1,3,4 


133 











1,3,5 


280 











1,4,5 


352 











2,3,4 


108 











2,3,5 


84 











2,4,5 


325 











3,4,5 


133 












where D, R are size-3 subsets of {1, 2, 3, 4, 5}, i £ D, and j G R. Every element of V'^^'^^ is of the form 

\ ' \ ' D,RcD,R 
D,R ij 

where, for every choice of D,R, X^ij^fj^ = 0- When ranking three candidates, choosing a domain, a 
range, and the ranking of one of the candidates automatically defines the unordered set of rankings for 
the other two candidates. Thus, for the analogous second-order unordered rank-3 functions, we have (for 
{ii,i2} C D,{j„j2} C R), 

cD,R _ i-D.R. 

"{n,i2}^{jij2} ~ °D\{iui2}>-^R\{ju32}- 

^(2^1) therefore carries pure second-order unordered statistics as well. 

Inner products of Z^^'"'^' with the S^^' are given in Table [3l Entries in these tables have been rounded 
to two decimal places. By the comment above, the inner products of /'^'^■' with the ja} -i^^^ 

permutations of the entries in Table [3l For example, the inner products of Z^^'^-* with ^ire 
given in Tabled 

If we denote the rank-3 portion of / by /a , 

CTGfl„:rk((T)=3 

then we have ll/'"^-* -|- /'■^'"'^^ll > -QOBll/all, so we discard the projection /(^'^^i) from our analysis. 

For comparison, the results of symmetric group spectral analysis, as applied to the rank-3 portion of /, 
are given in Table [S] [S', Table 9] . 

Through the examination of Tables [3] through [SJ we see that rook monoid spectral analysis under the 
groupoid basis association offers a more local, granular inspection of the data than the symmetric group 
spectral analysis of Example l3.6l does. in that it allows us to see how the natural subsets of the rank-/c voters 
vote amongst themselves. 

Positive numbers in these tables indicate a positive (larger-than-average) effect for choosing a candidate 
(or group of candidates) in a position (or group of positions), with higher values indicating stronger effects. A 
constant function would have all entries in these tables equal to (as V^^^^ is orthogonal to V^'^'^^). Negative 
values in these tables indicate lower-than-average effects, with larger-magnitude negative numbers indicating 
stronger negative effects. For example, the entry of —38.33 in the ((4, 5), (2, 3)) position of Table |4] indicates 
that voters who chose to rank candidates 1, 4, and 5 (in positions 1, 2, and 3) ranked candidates 4 and 5 in 
positions 2 and 3 (without regard to order) considerably less often than would occur in a uniform spread of 
votes. A quick glance at the votes in Table [T] reveals that the partial rankings [1, — , — , 2, 3] and [1, — , — , 3, 2] 
were indeed the two least-popular choices among the rank-3 voters ranking candidates 1, 4, and 5. 

Before we proceed with an examination of the numbers in Table[31 we recall the main pattern that Diaconis 
found in the overall dataset [6| . In the rank-5 data, Diaconis found large second-order unordered pair effects 
for ranking candidates 1 and 3 and candidates 4 and 5 in positions 1 and 2 and positions 4 and 5. He found 
similar patterns in the lower-order ranks in the dataset. The short story of this election was that candidates 
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Table 3. First-order groupoid analysis, rank-3 data 



D -- 


= {l,2,3},i? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 


3 


11 


-14 








2 


-40 


-19 


59 








3 


37 


8 


-45 








4 

















5 

















D = 


= {l,2,5},i? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 


-5.67 


-10.67 16.33 








2 


-7.67 


4.33 


3.33 








3 

















4 

















5 


13.33 


6.33 


-19.67 








D = 


= {l,3,5},i? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 


-3.33 


17.67 


-14.33 








2 

















3 


27.67 


-12.33 


-15.33 








4 

















5 


-24.33 


-5.33 


29.67 








D = 


= {2,3,4},i? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 

















2 


-10 


2 


8 








3 


7 


-9 


2 








4 


3 


7 


-10 








5 

















D = 


= {2,4,5},i? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 

















2 


-8.33 


-4.33 


12.67 








3 

















4 


7.67 


5.67 


-13.33 








5 


0.67 


-1.33 


0.67 











D -- 


= {l,2,4},i? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 




-24.33 


-6.33 


30.67 








2 




10.67 


4.67 


-15.33 








3 



















4 




13.67 


1.67 


-15.33 








5 





















D -- 


= {l,3,4},i? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 




6.67 


9.67 


-16.33 








2 



















3 




9.67 


-1.33 


-8.33 








4 




-16.33 


-8.33 


24.67 








5 





















D -- 


= {l,4,5},i? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 




-38.33 


-9.33 


47.67 








2 



















3 



















4 




18.67 


1.67 


-20.33 








5 




19.67 


7.67 


-27.33 










D -- 


= {2, 3, 5}, 7? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 



















2 




-5 


2 


3 








3 




4 


-5 


1 








4 



















5 




1 


3 


-4 










D -- 


= {3,4,5},i? = 


{1,2,3} 








Rank 


Candidate 


1 


2 


3 


4 


5 


1 



















2 



















3 




-10.33 


-11.32 


21.67 








4 




17.67 


-9.33 


-8.33 








5 




-7.33 


20.67 


-13.33 









Table 4. Second-order unordered groupoid analysis, votes with domain {1,4,5} and range {1,2,3} 





Rank 


Candidates 


1,2 1,3 2,3 


1,4 
1,5 
4,5 


-27.33 7.67 19.67 
-20.33 1.67 18.67 
47.67 -9.33 -38.33 



1 and 3 were on one side, candidates 4 and 5 on the other, and candidate 2 was somewhere in the middle, a 
bit closer to candidates 4 and 5. Voters primarily tended to support one of these sets of candidates, either 
{1, 3} or {4, 5}, and then chose between them. This is supported by the second-order data in Table [5] The 
first-order information in Table [5] also shows that among all rank-3 voters, candidates 4 and 5 were preferred 
overall. Note that the information in Table [5] accounts for the fact that in this election, for rank-3 voters. 
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Table 5. Diaconis's first-order and second-order unordered analysis, rank-3 data 



15 





Rank 


Candidate 


1 


2 


3 


1 


2 


76 


114 


2 


-78 


-28 


52 


3 


2 


-103 


-116 


4 


38 


-7 


-48 


5 


35 


63 


-1 





Rank 


TTj 

OanQiQates 




1,3 2,3 


1,2 


-oU 


D 12 


1,^ 


150 


-3 -41 


1,4 


-71 


-8 11 


1,5 


-28 


5 16 


2,3 


-2 


24 28 


2,4 


57 


-5 -7 


2,5 


-5 


-24 -34 


3,4 


-84 


-12 -4 


3,5 


-63 


-8 17 


4,5 


97 


26 



unranked candidates are implicitly ranked in one of the last two positions — this is automatically accounted 
for in the creation of this table because Diaconis's technique for symmetric group spectral analysis begins 
by averaging over missing data for partially ranked data. 

Table |3] allows us to proceed with a more local, granular examination of the rank-3 data. Examining these 
numbers we see that, among the rank-3 voters, there is a consistent positive effect for ranking the candidate 
pairs (1, 3) and (4, 5) in positions 1 and 2 (without regard to order). We also see a positive first-order 
effect — and often a strong one — for ranking candidate 3 in position 1 whenever candidate 3 is ranked, except 
when the other two candidates ranked are candidates 4 o-i^-d 5. These observations are consistent with the 
overall patterns in the dataset and further support the idea that candidates 1 and 3 were on one side of the 
election and candidates 4 and 5 were on the other side. We can also examine subsets of these tables together 
to glean further insights — for example, the numbers in Table |3] show that among the rank-3 voters ranking 
both candidates 1 and 3, candidate 3 was heavily preferred. This information is not readily apparent from 
Tabled 

An examination of the projections /'*' for A = 2, 1,0 would proceed in a similar fashion, and is omitted. 

4.6. Rook monoid spectral analysis under the semigroup basis association. Let / : i?„ — ?> C. We 

now define rook monoid spectral analysis under the semigroup basis association, i.e., where we view / as an 
element of Ci?„ by 

f=Y^ f{a)a. 

Note that /, when expressed with respect to the groupoid basis, is 
where 

t>cr 

We continue to use the inner product on Ci?„ induced by declaring the groupoid basis mutually orthonor- 
mal. We will remark on the alternative inner product (induced by declaring the semigroup basis mutually 
orthonormal) in Section [4.71 

As we did in Section [4. 4| to analyze / we project / onto the isotypic subspaces of Ci?„ and compute inner 
products with the appropriate easily interpretable functions. In general, let E be an easily interpretable 
function for rank-fc data. Then E : i?„ {0, 1} and E{a-) = if rk(CT) k. Let Eq and Es denote E viewed 
as an element of Ci?„ under the groupoid basis association and the semigroup basis association, respectively. 
We claim that we get the same numbers regardless of whether we compute inner products of the projections 
of / with Eg or Eg- To see this, let A h fc. We have 

Eg^ E ^(^)W' e ^('^)'^- 

(TGfl„:rk((T)=fc (TG-R„:rk((T)=fc 
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Now, Es, when expressed with respect to the [crj basis, is of the form 

(Te-R„:rk(CT)=fe cre_R„:rk((T)<fc 

for some fmiction z on Also, the projection when expressed in terms of the groupoid basis, contains 
nonzero coefficients only for elements [crJ for which rk(cr) = k. Therefore, l^f^^Ec^ = (^f^.Es). 

Definition 4.5. Let / : _R„ C. The statistics created by projecting X^o-e-R fi'^)'^ onto the isotypic 
subspaces of Ci?„ and computing the inner products of these projections with the appropriately-paired 
easily interpretable functions constitute the rook monoid spectral analysis of f under the semigroup basis 
association. 

By the discussion above, we have: 

Theorem 4.6. Let f : Rn — >■ C. Then the statistics generated by the rook monoid spectral analysis of f 
under the semigroup basis association are the same as the statistics generated by the rook monoid spectral 
analysis, under the groupoid basis association, of the function g : Rn C, where 

Theorem 14.61 shows that rook monoid spectral analysis, under the semigroup basis association, offers a 
hierarchical approach to the spectral analysis of partially ranked data, where the rank-fc analysis is derived 
not just from the data of rank k, but instead from the data of rank k and higher. 

4.7. Rook monoid spectral analysis under the natural inner product. Let / : i?„ — C. In this 
section we give a couple of remarks about what happens if we try to apply rook monoid spectral analysis 
to / under the natural inner product on Ci?„, induced by declaring the semigroup basis of Ci?„ mutually 
orthonormal. As we saw in Section 14. 3[ under this inner product the isotypic subspaces of Ci?„ are not 
mutually orthogonal in general. This can interfere with a sum-of-squares analysis of the lengths of the 
projections of /, but as we are about to see, if we are careful then it will create the same easily-understood 
statistics as in Sections 14.41 and 14.61 

In particular, let E be an easily interpretable function for rank-A; data. Then E : Rn — > {0,1} and 
E{a) = if rk((T) ^ k. Let (•, •) denote the inner product on Ci?„ induced by declaring the groupoid basis 
mutually orthonormal, and let (•,-)^ denote the natural inner product on Ci?„. View / as an element of 
CRn under the semigroup basis association or the groupoid basis association, and express / with respect to 
the groupoid basis. So, if we are using the groupoid basis association, then 

/ = E 

and if we are using the semigroup basis association, then 

where g{a) = J2t>a fi^)- A be a partition of k and consider the projection If Eq and Es denote 
E viewed as an element of CRn under the groupoid basis association and semigroup basis association, 
respectively, we saw in Section l46l that (/'*', iJ^) = {f^:Es). A similar argument shows that {f^,Es)^ = 
Eq) , so we have 

{f\Es)^ = {f\EG) = {f\Es). 

In this way we can use the natural inner product to perform rook monoid spectral analysis. 

However, in general (/^, Eg) ^ will be different due to interference from terms of rank lower than k in the 
inner product, as will inner products such as (^f,Eg) and (^f'^,Eg) . Here is a simple example. Consider 
/ e Ci?2 by 

-)"(^ -)■ 
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In the notation of Scction|131 let E = (J^iJ-^^l, so Es = l{ \ ^ ]. Also 



2 



Eg = 1 



1 2 

2 - 



Denote by and E]^ the projections of / and Es onto T^'^-* (the only isotypic of Ci?2 of dimension greater 
than 1). We have 

if _ 2 +n 2 - +n - 1 



and 



First, note that (/^. £'g) = 9, which tells us that there are 9 partial rankings in / which map 1 to 2. We also 
have (/, i?^.)^ = 4, which measures how many more rank-1 partial rankings in / map 1 to 2 than there are 
null rankings in /. Also, {f\El;)^ = 32 = {f.Ec)^, which measures the total number of partial rankings 
in / which map 1 to 2 plus the total number of partial rankings in /. In either case we can extract useful 
statistics (such as the number of rank-1 partial rankings in / which map 1 to 2, or the total number of partial 
rankings in / which map 1 to 2) by performing the appropriate additions and subtractions. However, such 
desirable statistics are already available to us, with no additional effort required, if we simply use the inner 
product (•, •) instead of the natural one. We obtain other benefits by using (•, •) as well, such as orthogonality 
of isotypic subspaces. 



5. Concluding remarks and open questions 

As we have seen, if S' is a finite inverse semigroup, then via the groupoid basis of CS", spectral analysis 
based on S essentially boils down to group-based spectral analysis using the maximal subgroups of S. 

The essential algebraic component that enables inverse semigroup spectral analysis seems to be the 
semisimplicity of CS*, which allows us to write any element / of any CS'-module uniquely as the sum of 
its isotypic projections. For most finite semigroups 5, CS" is not semisimple. An intriguing example is 
S ~ Tn, the full transformation semigroup on n elements. The question of what spectral analysis based 
on S should mean in such a case remains an open question. In the simplest case, if / G CS where CS is 
not semisimple, then the collection p(/), as p varies over a complete set of inequivalent, irreducible rep- 
resentations of CS", does not uniquely determine /. What kinds of information do we lose if we only use 
the irreducible representations of CS" and perform calculations with inner products as in this article? This 
is really a question about the Jacobson radical of CS* and how it interacts with "interpretable" functions 
based on S. If the irreducible representations are not enough to capture what statistics we want about a 
function, should we consider the indecomposable representations instead? And how should we go about 
understanding the spectral analysis of arbitrary functions (i.e., elements of arbitrary CS'-modules) if S has 
infinite representation type, as is the case for S = Tn for n > 4 '20^? Furthermore, if S is an arbitrary finite 
semigroup, is there a "correct" notion of an inner product on CS in general, analogous to the inner product 
obtained by declaring the groupoid basis of <CS orthonormal when S is an inverse semigroup? As we have 
seen, sometimes the natural inner product is not the most useful one. 
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Appendix A. Basic representation theory for inverse semigroups 

In this appendix we review the basic definitions from the representation theory of inverse semigroups. 
These ideas carry over with little or no modification to general semigroups. For a treatment of the represen- 
tation theory of semigroups in general, see [19]. Let S' be a finite inverse semigroup. 
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Definition A.l. The complex algebra of S, denoted CS, is (as a vector space) the C-span of the symbols 
s € S. The muhiphcation in CS, cahed convolution and denoted by *, is defined by the hnear extension of 
the multiphcation in S by the distributive law. 

If is a group, convolution may be written in the familiar way: If /, <? G CS* with / = ^^'^ 
9 = Eses5(s)s, then 

/ * .9 = fi.'r)9{'r~^s)s. 

If S has an identity I5, then CS has a multiplicative identity, namely 1 • I5. Even if S does not have 
an identity, CS does. We can see this from the semisimplicity of CS' [18] and the Wedderburn isomorphism 
(Theorem 13.21) — the inverse image of the identity of the algebra on the right of ([T]) is the identity of CS. 
Denote the identity of CS by Ics and the algebra of n x n matrices over C by M„ (C) . 

Definition A. 2. A matrix representation (or just representation) p of CS* of dimension dp G N is a linear 
map p : CS — > Mdp{C) for which p{ab) ~ p(a)p{b) for all a,b ^ CS, and for which p{lcs) is the identity 
matrix. 

Equivalently, a representation of CS is a finite-dimensional C-vector space which is also a unital left 
CS-module. In this paper we only consider finite-dimensional representations and unital modules, so left 
CS-modules and representations of CS are the same. 

Definition A. 3. Matrix representations pi,p2 of CS* are equivalent if there is an invertible matrix A such 
that 

Api{x)A-^ ^P2{x) 

for aU X e CS. 

That is, two representations arc equivalent if they are isomorphic as left CS-modules. 

Definition A. 4. A representation p of CS" is irreducible if it is simple as a left CS-module. 

Equivalently, p is irreducible if there do not exist representations pi,p2, a matrix valued function g, and 
an invertible matrix A for which 

" Pi{x) 
g{x) p2{x) 



Ap{x)A-^^ = 
for aU X e CS". 



Definition A. 5. CS is said to be semisimple if every left CS-module is equal to a direct sum of simple left 
CS-modules. 



Appendix B. The groupoid basis of a finite inverse semigroup 

Let iS be a finite inverse semigroup. There is a natural partial order on S given by, ioi s,t £ S , t < s 
if and only iit = es for some idempotent e € S [16]. Notice that if S" is a group, then this partial order is 
trivial in the sense that t < s if and only if t = s. Recently B. Steinberg has used the Mobius function of this 
partial order to realize the decomposition of CS into a direct sum of matrix algebras over group algebras 
[5S]. To see how this works, we begin by reviewing the groupoid basis of CS [15]. 

Definition B.l. The groupoid basis of CS is given by the collection {[sjjsgs, where 

teS:t<s 

and p is the Mobius function of the natural partial order on S. 

For x,y € Rn, x < y if and only if y extends a; as a partial function. We can recover the semigroup basis 
of CS in terms of the groupoid basis by inverting the Mobius function: 

E w- 

teS:t<s 

The groupoid basis is a basis for CS, whose multiplication is given by the following formula [25) : 
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(4) WW 



if s-^s = tr 
otherwise. 



For s £ Rn, s~^s is the partial identity on dom(s) and ss~^ is the partial identity on ran(s) (keeping 
in mind that we view maps as acting on the left of sets and that we compose maps from right to left). It 
follows that for s,t e 

\_st\ if dom(s) — ran(t), 
otherwise 



WW 



in Ci?„. 

We will also need Green's X»-relation [iligil^: 

Definition B.2. Let e, / e S* be idempotent. We say e and / are isomorphic if there is an element s e S" 
such that e = s^^s and / — ss^^. Idempotents e and / are said to be V-related if they are isomorphic. In 
general, elements s,t G S are said to be D-related if s~^s is isomorphic to t~^t. 

The equivalence classes of S under the P-relation are the P-classes of S. An equivalent characterization 
of V is that s and t are D-related if and only if s and t generate the same two-sided ideal in S. For _R„, the 
idempotents are the restrictions of the identity map, and two idempotents are isomorphic if and only if they 
have the same rank. i?„ has n + 1 D-classes. They are Dq, Di, . . . , D„, where D/. is the set of elements of 
Rn of rank k. 

Definition B.3. A subgroup of 5* is a subset of S which is also a group. A subgroup G of S" is maximal if 
G is not contained in any other subgroup of S. 

Given an idempotent e of S, there is precisely one maximal subgroup of S containing e called the 
maximal subgroup of S at e and denoted Ge- In fact [25] 

Ge = {s & S : s~^s = ss~^ = e}, 

and e is the identity of Ge- If e and / are isomorphic idempotents, it is straightforward to show that Ge = Gf. 
For Rn, the maximal subgroup at any idempotent e of rank k is isomorphic to Sk- 

We can now describe Steinberg's decomposition of CS into a direct sum of matrix algebras over group 
algebras. Let Do,-..,Dn be the D-classes of S. Let CDk be the C-span of {[sj : s E Dk}. From ([4]) it 
follows that CS' ~ ®fe=o '^■^k- The following theorem can be found in [25] . 

Tiieorem B.4. Let rk indicate the number of idempotents in Dk, and let Ck be any idempotent in Dk. Denote 
the maximal subgroup of S at Ck by Gk- Then there is an algebra isomorphism (f> : CDk — )• Mr^{CGk)- 



The isomorphism (j) that Steinberg constructs to prove Theorem lB.4l is given explicitly as follows. For each 
I?-class Dk, fix an idempotent e^. For every idempotent a G Dk, fix an clement pa G S such that Pa^Pa — Gk 
and PaPa^ — ctj taking pe^. — Ck- It is straightforward to show and important to note that pa G Dk (and 
hence ^ G Dk as well). View the rk x rk matrices as being indexed by pairs of idempotents in Dk- Define 
on the basis { [sJ : s G Dk} of CDfc in the following manner: for an element [sJ G CDk with s^^s — e and 
ss-'^f, 

(t^{[s\) ^ Pf^^SPeEf^e, 

where Ef^e is the standard x matrix with a 1 in the /, e position and elsewhere. We have that 
Pf~^spe G Gk by construction, the linear extension of cf) to CDk is the isomorphism, and the inverse of (f) is 
induced by, for s G Gk, 

sEf^e lpfSPe~^\- 

Thus CS^^l^^Mr.iCGk)- 

This gives us the following powerful method for constructing the irreducible representations of CS" from 
the irreducible representations of the maximal subgroups of S |25j . 

Theorem B.5. Let Dq, Di, . - . , Dn be the V-classes of S. For each k G {0,1,..., n}, fix an idempotent 
Gfc G Dk- Let Gk be the maximal subgroup of S at Ck, and let lRR{Gk) be any complete set of inequivalent, 
irreducible matrix representations for Gk- Then the irreducible representations of CS are in one-to-one 
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correspondence with the elements of ^'^^Q^RRiGk)- Specifically, given an irreducible representation p of 
CGk, form the irreducible representation p of Mr^{CGk): 

for g e Gk, where Eij is the standard rk x rk matrix with a 1 in the i,j position and elsewhere. Extend 
p linearly to the rest of Mr^.{CGk), and further extend p to CS by letting it be on the other summands of 
^^^Q Mrf. (CGk)- As p ranges over W5!^QlRR(G'fc), the p form a complete set inequivalent, irreducible matrix 
representations of CS . 

We will use this theorem to help us describe the isotypic subspaces of CS in terms of the isotypic subspaces 
of the complex algebras of the maximal subgroups of S in Appendix [Cj First, however, we explain what the 
isomorphism (j) from Theorem IB. 41 translates into when S = Rn- 

For a 2?-class Dk of i?„ (that is, the subset of elements of i?„ of rank fc), let us take e Dk to be the 
partial identity on {1, . . . , fc}, that is, 

/ 1 2 ••• k k + 1 ■■■ n \ 
""^^[l 2 ... fc - ... - )■ 

We then have 

Gk = {se Rn : dom(s) = ran(s) = {1,2,..., k}}. 
We identify G^ with the permutation group Sk in the obvious manner. 

For an idempotent a S Dk (that is, a rank-fc restriction of the identity map), let us take Pa to be the 
unique order-preserving bijection from {1, 2, . . . , fc} to dom(a) — ran(a). For an element s G Rn of rank k, 
define the permutation type of s, perm(s), to be, informally, the "arrows" from dom(s) to ran(s), expressed 
as a permutation in Gk — Sk. For example, if 

^ = ( 4 ^ 1 2 ) ' " ( 3 1 2 ) 

because s sends the first element of its domain to the third element of its range, the second element of its 
domain to the first element of its range, and the third clement of its domain to the second element of its 
range. 

Formally, we define 

perm(s) ^ Pss-^'^sp^-i^, 

where Ps~^s is the unique order preserving bijection from {1, 2, . . . , fc} to dom(s) and Pss-i~^ is the unique 
order preserving bijection from ran(s) to {1, 2, . . . , fc}. 

The isomorphism (p from Theorem IB. 41 now works as follows. We have (^) x (^) matrices, so let us index 
their rows and columns by the fc-subsets of {1, 2, . . . , n}. We have 

CDk^M(^,.-^{CSk) 

where, if s e _R„ has rank fc, then '/'([sj) — perm(s)i?i.a,i(s).doin(s). This result was implicit in Munn's work 
on the rook monoid |17| . and was first written down explicitly by Solomon [23) . Solomon's isomorphism is 
essentially the same as the one we just described. As a corollary, we have: 

Corollary B.6. Ci?„ ^ ©Lo M(^) (C5fc). 

Appendix C. Isotypic subspaces and an orthogonal inner product 

Let S' be a finite inverse semigroup. In this appendix we use the results of Appendix[B]to describe Fourier 
bases of CS* and isotypic projections in CS* in terms of those of the CG, as G ranges over the maximal 
subgroups of S, and we give an inner product on CS* under which the isotypic subspaces of CS are mutually 
orthogonal. 

As in Appendix IB| let Do, ... , Z?„ be the P-classes of S, let denote the number of idempotents in Dk, 
pick an idempotent Ck in each 2?-class Dk, and let Gk be the maximal subgroup of S at e^. Then by Theorem 
IB. 51 the isotypic subspaces of CS* are in one-to-one correspondence with the isotypic subspaces of the CGk , 
as fc ranges from to n. Also, as in the isomorphism from Theorem lB.4l for every idempotent a G Dk, fix an 
element pa ^ S such that Pa~^Pa = and PaPa^^ — cl (and take p^f. — e^). Let IRR(Gfc) be a complete set 
of inequivalent, irreducible matrix representations of CGk- For each p e IRR(Gfc), let p denote its extension 



INVERSE SEMIGROUP SPECTRAL ANALYSIS FOR PARTIALLY RANKED DATA 



21 



(as in TheoremEl]) to 0)!^o Mr^^CGk)-, and hence to CS. hei y = {p : p e K=QS(R{Gk)}, so that 3^ is a 
complete set of inequivalent, irreducible matrix representations of C5. 

We begin to describe the isotypic subspaces of CS* by describing the Fourier basis for CS* according to 
3^ in terms of Fourier bases of the CGfei If i? C <CS is the set of inverse images of the natural basis of 
0^g-y Mrf-(C) in the Wedderburn isomorphism 

(5) 0p:C5^0Mrf,(C), 

pey pey 

then for each y E B, 

ses 

We will describe the coefhcients y{s). 

Suppose we already have an explicit description of a Fourier basis for CGfc for each k G {0, . . . , n}. That 
is, if C is the set of inverse images of the natural basis of the algebra on the right in the isomorphism 

(6) p:CG,^ Mrf^(C), 

pelRR(Gfc) pelRR(Gfc) 

then, for each c G C, 

c = c{x)x. 

xeGk 

We will describe the coefficients y{s) in terms of the c{x). 
Let p e IRR(Gfc), and let c,,j G CG^, 



y ^ Ci.j(a;)a;, 



xeGk 

be the inverse image in the isomorphism ^ of the element of -^'^dp (C) that is 1 in the i,j 

position in the p block and elsewhere, p maps to block matrices whose rows and columns are indexed by 
the idempotents in Dk, and whose entries are themselves dp x dp matrices. We have the following description 
of a Fourier basis for C5, which generalizes the description for Ci?„ given in [TT| . 

Theorem C.l. Let X be a dp x dp matrix with a 1 in the i,j position and elsewhere. For idempotents 
a,b G Dk, let Eb^a be an r^ x matrix with a 1 in the b,a position and elsewhere. The inverse image in 
the isomorphism ([5]) of the element of ^^^y Mdp{C) that is E^^a ® X in the p block and elsewhere is 



[Pb\ Y ) ba ^J- 

\xeGk J 



Proof. Suppose 1 ^y, 1 ^ p, and 7 e IRR(Gfc). Then 

7 

(where indicates the zero matrix), because 



\xeGk J J 



7 



\xeGk / \xeGk / 



Eek,ek®^ 



= 0. 

Suppose now that 7^3^ and 7 G IRR(Gj) with j ^ k. Then 



7 



M ( E ba-^j) =0 

\xeGk J J 
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because 



\xeGk I xeGk 



0. 



Finally, 



P ((PPa-Va)""^Pa^"^PPaPa-l'^Pa-lpa:PaPa-0 

--p{ekEb^e,) {Ee^,e, ® X) p {ekEe^^a) 

-- {Eb,e, ® Id,) {Ee^,e, ® X) {E^.^a ® Id,) 

~-Eb,a ® X. 



□ 



Note that the Fourier basis element of CS in Theorem I C.l[ when expressed in terms of the groupoid basis, 
has nonzero coefficients only for elements [sj for which s e Dk, as pb G D^, Pa^ & Dk, and x £ for all 

For Ci?„, given a Fourier basis B = {}2,„^g^ hi{a)a}\Ji for an isotypic subspace V for <CSk, where A is a 
partition of A:, we obtain a basis for the corresponding isotypic subspace of Ci?„ by forming the products 



as a and 6 range over the size-fc subsets of {1, 2, . . . , n} (where pb is the unique order-preserving bijection 
from {1, 2, . . . , fc} to 6 and is the unique order-preserving bijection from o to {1, 2, . . . , fc}) and the term 
in the middle ranges over the elements of B. 

We can now describe the isotypic projections of an element / G CS* in terms of isotypic projections in the 

Theorem C.2. Let a,b E be idempotent. Let f G CS have the form 

where /(s) = unless s G Dk, s~^s = a, and ss^^ = b. Let fa be f viewed as an element ofCGk, i-e., 

fG= E fis)Pb^spa. 

s^Df^-.ss^^ —b,s^^ s—a 

Let IRR(Gfc) — {pi, . . . , pq}. For z G {1, . . . , q}, denote the isotypic subspace of CGk corresponding to pi by 
Wi and the isotypic subspace of CS corresponding to pi by Vi. Let fQ,...,fQ denote the projections of fa 
onto Wi , . . . , Wq , and suppose 

/g = E •'^(f)^- 
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Then 

i=l 

[PblfclPa^l projection of f onto Vi, and 

In general, if / G CS*, then / may be written as a sum of elements of CS* of the form in the hypothesis of 
this theorem. We can then understand the projections of / in terms of the projections of these elements. 

Proof of Theorem I C. 2\ By the isomorphism in Theorem IB. 41 we can write 

/ = E f(Pb9Pa')[Pb9p-'\ = bbj E f^b9p-^)\-9\ ba'j 

geGfc \geGk ) 

and 

/g E f(Pb9Pa^)9- 
Proposition 4.3 of f25| states that, for s,t € S, 



(7) [s\t 
From this we see that 



[st] ifs-^s<tt^^, 
otherwise. 



b;.J/Gba'J, 

and since 

/g = + • • • + /S, 

we have 

/ = bbj fh [Pa'\ + [Pb\ fa VPa'\ + ■ • ■ + b^-J fh \Pa'\ ■ 

Next, since 

/g = E ^^(9)9, 

applying to bbJ/cba^J yields that 

b^>J/Gba'J = b;>J (E^^C9)bJ| ba'J- 
\96Gfc / 

Finally, if we choose a Fourier basis of CGfc and write the in terms of this Fourier basis, equation ([7|) and 
Theorem lC.il say that for all i, b&J/cba^J ^ That is, b&J/cba^J the projection of / onto Vi, as 
claimed. □ 

Finally, we give an inner product on CS under which its isotypic subspaces are mutually orthogonal. 
Theorem C.3. Let (•,•) be the sesquilinear form on CS induced by, for s,t S, 



1 if s = 
otherwise. 



(bJ,W) 

Then, with respect to this inner product, the isotypic subspaces of <CS are mutually orthogonal. 
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Proof. By linearity, it suffices to show that (w, -y') = in the case that v and v' are Fourier basis elements of 
CS in distinct isotypic subspaces. We may assume that v and v' are part of a Fourier basis for CS according 
to y. Let p, p' e y. Let G and v' € Vpi , with Vp 7^ Vp' (and hence p p'). 
We know that 

CGfe = l^p 

pelRR(Gfc) 

where Wp is the sum of all the irreducible submodules of CGfc isomorphic to the representation p. If 
w G Wp, w' G Wp', and Wp ^ Wp' , then under the inner product [•, •] on CGk defined by 

[w,w'] = w{s)s, ^ w'{s)s] = ^ w(s)w'(s), 

it follows from the discussion in Chapter 2 of [22 that we have [w, w'] — 0. 

Now, suppose that p is the extension of p G IRR(Gfc) and that p' is the extension of p' G IRR(Gj). 
By Theorem IC-H when written in terms of the groupoid basis, v contains nonzero coefficients only for the 
elements [sj such that s G Dk, and v' contains nonzero coefficients only for the elements [sj such that 
s G Dj. Thus, ii k ^ j, we have {v,v') = 0. Suppose then that k = j. By Theorem IC.li we have 

VsGGfc / 
VsGGfc / 

for 5, a, 6', a' some idempotents in in Dk, and 

seGfc sSGfc 

some Fourier basis elements for CGk- 

li a ^ a' or 6 7^ 6', it is apparent that {v, v') = 0, so suppose further that a = a' and b = b'. 
Now, since p ^ p' and k — j, we have p ^ p' , and we therefore note that 

[E E «'W^] = o. 

sGGfc seGfc 

Now, we have 

{V,V') = E E ^'(s)^(LPbSPa"'j, bftiPa-'j) , 

seGfc teGfc 

and, since s,i G G^, [ptspa^^J = L-Pfc^Pa^^J if and only if s — t, so 

{v, v') = E ^(^)^ = [ E ^(^)' E ^'(^)] = 0- 

sSGfc seGfc sGGfc 

□ 
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